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(57) Two-dinnensional barcodes, each having en- 
coded digital infomnation in a bitmap representing pref- 
erably randonnized encoded data bits, are printed onto 
a printed mediunn. Preferably, error correction codes are 
added to the digital information to ensure that the de- 
coding process accurately reproduce the digital infor- 
mation. In one embodiment, the bitmap may further in- 
clude "anchor" bits in each comer, which are used as 
part of the skew estimation and deskewing processes 
during decoding. In a second embodiment, no "anchor" 
bits are required. The encoded digital information is 
mapped into the two-dimensional barcode in such a way 
as to minimize the errors caused by damage to particu- 
lar rows and/or columns, for example, row damage 
caused by faxing the printed barcode. To extract the en- 
coded digital information from the printed medium, the 
printed medium is scanned, then the bitmap is located 
within the printed medium. The skew of the bitmap, if 
any, is determined, and the bitmap is deskewed if nec- 
essary. The bitmap is then cropped, and the randomized 
digital information is read from the bitmap. The digital 
information is derandomized and any error correction 
codes are removed, in the process correcting and/or re- 
cording any errors discovered, thereby reproducing the 
original encoded digital information. 



FIG. 8A 

Scanned B&W Bitmap 



Locate 



-70 
Fail 



Error-no 
Pana Mark 
Detected 



Estimate Skew 
Angle 0 
(Hough Transform) 



0<^<a 




-7) 



Trigonometric 
Deskew 



Crop 



74 



"75 
Fail 



Read 



-76 
Fail 



Error-no 
Pana Mark 
Detected 



£rror-no 
Pana Mark 
Detected 



NxM Bit Array 



Printsd by Jouve, 75001 PARIS (FR) 



1 



EP 0 962 883 A2 



2 



Description 

Field of the Invention 

[0001] The invention relates generally to two-dimen- 
sional barcodes, and, more particularly, to a robust clock 
tree two-dimensional barcode symbology, encoding in- 
formation formatted in such barcode symbology, print- 
ing the barcode symbology onto a printed medium, and 
decoding the same. 

Background of the Invention 

[0002] Contrary to the frequent predictions that we will 
one day live in a 'paperless society", paper, and other 
printed mediums, are playing an increasingly important 
role as an inexpensive, effective and convenient means 
for communication. Afundamental limitation with paper, 
however, is that from a computer standpoint, it is cur- 
rently an output-only format. While paper may be the 
preferred medium for displaying information for human 
use, it is difficult, if not impossible, for a computer to re- 
cover data reliably once it has been printed. Optical 
character recognition (OCR) attempts to solve this prob- 
lem in a relatively simple domain, such as text rendered 
using standard fonts, but has met with only limited suc- 
cess thus far. While accuracy rates of ninety-nine (99%) 
percent are perhaps achievable and may seem impres- 
sive, a page with 3,000 characters will still incur an av- 
erage of thirty (30) OCR errors and hence requires ex- 
pensive and time consuming manual post-processing. 
[0003] Another approach uses computer readable 
barcodes which may be included directly on paper (or 
other printed medium such as microfilm). Once encod- 
ed, such barcodes can be used by the computer to re- 
cover information evident to the human reader but diffi- 
cult for a computer to recognize (e.g., printed text), in- 
formation implicit to the creation of page but essentially 
invisible to the human reader (e.g., spreadsheet formu- 
las), or any other information desired, whether or not de- 
pendent on the actual character text on the paper. 
[0004] Computer readable barcodes, wherein digital 
data is recorded directly on paper, are known and have 
been utilized to provide document or product identifica- 
tion given a fixed set of values using simple numeric en- 
coding and scanning technologies. Document or prod- 
uct identification systems which have been employed in 
the past include barcode markers and scanners which 
have found use in a wide range of arenas. With respect 
to paper documents, special marks or patterns in the 
paper have been used to provide information to a related 
piece of equipment, for example the job control sheet 
for image processing as taught by Hikawa in U.S. Patent 
No. 5,051 ,779. Similarly, identifying marks comprising 
encoded information have been printed on the face of 
preprinted forms as described in U.S. Patent No. 
5,060,980 to Johnson, et al. The Johnson, et al. system 
provides for a user entering hand drawn information in 



the fields on a paper copy of the form and then scanning 
the form to provide insertions to the fields in the dupli- 
cate form that is stored electronically in the computer. 
Still another system is described in U.S. Patent No. 

s 5,091 ,966 of Bkx>mberg, et al., which teach the decod- 
ing of glyph shape codes, which codes are digitally en- 
coded data on paper. The identifying codes can be read 
by a computer and thereby facilitate computer handling 
of the document, such as identifying, retrieving and 

10 transmitting such document. 

[0005] Besides the various shaped barcodes de- 
scribed above, two-dimensional barcodes called 'data 
strips" having a plurality of rows of "data lines" that rep- 
resent information digitally encoded on printed media 

IS are also known in the art. Each data line row consists of 
a series of black and white pixels each representing bi- 
nary "0"s and "1 's. The ordering of the bits in each row 
determines the digital data stored therein. The data 
stored within the totality of the rows define the data con- 

20 tained in the two-dimensional barcode. Typically, to read 
the barcode, the user passes a hand scanner, which si- 
multaneously reads the information in each data line 
row, vertically along the length of the barcode to read all 
of the data line rows. 

25 [0006] An example of a prior art system using a data 
strip two-dimensional barcode having rows of data lines 
with paper media, is found in U.S. Patent Nos. 
4.692.603, 4.754,127 and 4,782,221 of Brass, et al. In 
this system, two-dimensional barcodes consist of data 

30 line rows which are used to encode computer programs 
and data on paper and are scanned by use of a hand 
scanner. In addition to encoding the computer programs 
and data, these data lines also contain tracking and syn- 
chronization bits, hereinafter referred to as "clock bits". 

35 The requirement for use of numerous clock bits directly 
within each data line row, significantly reduces the 
amount of digital data that can be stored within each row. 
Further, if data line rows having clock bits are damaged, 
which is comnnon if such barcodes are photocopied or 

40 transmitted by facsimile systems, such clock bits would 
be tost making it difficult, if not impossible, to decode the 
information encoded in the barcode. 
[0007] U.S. Patent No. 5,083.214 of Knowles de- 
scribes a two-dimensional barcode system that also re- 

^ quires embedding clock bits within the encoded data it- 
self. However, instead of encoding every row and col- 
umn of data with synchronization and clock bits, the 
Knowles '214 Patent uses the uppermost row in a data 
barcode to provide synchronization during scanning. 

so Further, the first eight left most columns of bits comprise 
clock bits that are used for establishing the clocks to 
read the data bits contained in each row Since the up- 
permost row and all eight left-most columns of the two- 
dimensional barcode, described in the Knowles '214 

ss Patent, are used solely for synchronization and clock- 
ing, the overall data capacity of the barcode is propor- 
tionally decreased. Furthermore, if the uppernnost row 
or left most columns of the barcode are damaged, ckxk 
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information will be lost making it difficult, if not impossi- 
ble, to read out the data encoded in each data row as 
with other prior art systems. 

[0008] U.S. Patent No. 4.924.078 to SanVAnselmo et 
ai. describes a two-dimensional barcode system in s 
which an orientation and/or timing cell border is included 
within the body of the barcode itself. For all cases in 
which the exact location, symlx>l size and data density 
are not known in advance of reading the barcode, the 
border is required by the Sant'Anselmo "OyS Patent and 
must be smooth and on at least two sides of the data 
area. In addition, timing cells are required when certain 
Information is not a priori known, such as the number of 
cells per side. Here again, because the orientation and/ 
or timing cell border is included within the barcode itself, 
the overall data capacity of the barcode is proportionally 
decreased. Furthermore, because critical information 
about decoding the barcode is extracted from the orien- 
tation border, damage to a single row or column of the 
embodiment of the Sant'Anselmo '078 Patent in which 
the border Is present on only two sides also makes it 
difficult, if not impossible to read out the data encoded 
within the barcode. 

[0009] In co-pending patent application "A Clock-Free 
Two-Dlmenslonal Barcode and Method for Printing and 
Reading the Same", (Serial No. 08/569.280. filed De- 
cember 8, 1995) ("the '280 Application") now published 
as EP-A-0783160, the contents of which are explicitly 
incorporated by reference herein, a clock-less two di- 
mensional barcode with a border on at least one of the 
four sides of the barcode Is described, which border Is 
placed outside the confines of the barcode Itself. The 
two-dimensional barcodes are called "PanaMarks"®. 
As depicted in FIG. 1 A, PanaMark 10 is printed in the 
low right hand corner of printed page 11, although this 
position is completely arbitrary. In the embodiment de- 
picted In FIG. 1 A, the remaining portion of printed page 
11 is occupied by printed text 12. However, as one 
skilled in the art will appreciate, any type of computer- 
generated printed material, for example a spreadsheet 
or graphics, can be substituted for the printed text 12. 
The PanaMark 1 0 depicted in FIG. 1 B includes a border 
1 3 that is present on all four of Its sides. Although, as Is 
fully described in the '280 Application, the border 13 Is 
only needed on one of the four sides of the PanaMark 
10, for aesthetic reasons it is typically Included on all 
four sides. 

[0010] It is therefore an object of the present Invention 
to provide a clock free two-dimensional barcode repre- 
senting digitally encoded data. 

[0011] It is an additional object of the present inven- 
tion to provide a border-less clock free two-dimensional 
barcode representing digitally encoded data. 
[0012] Another object of the invention is to provide a 
border-less clock free two-dimensional barcode printed 
on a medium that is sufficiently robust to withstand dam- 
age to the medium. 

[001 3] A further object of the Invent bn is to provide a 



border-less clock free two<iimenslonal barcode repre- 
senting digitally encoded data, without the need for any 
Internal blocks used for alignment purposes. 
[0014] It is yet another object of this invention to en- 
code information formatted In a border-less clock free 
two-dimensional barcode for printing on a printed medi- 
um. 

[0015] An additional object of this Invention Is to pro- 
vide a method of decoding Information digitally encoded 
in the form of a border-less clock free two-dimensional 
barcode printed on a printed medium. 
[0016] Various other objects, advantages and fea- 
tures of the present invention will become readily appar- 
ent from the ensuing detailed description and the novel 
features will be particularly pointed out in the appended 
claims. 

[0017] These and other objectives are realized by a 
two-dimensional data barcode having encoded digital 
information in a bitmap formatted representation of ran- 
domized data bits having a selected pixel row and col- 
umn length. Preferably there is no border surrounding 
the barcode. 

[0018] In one embodiment of this invention, a two-di- 
mensional data barcode has encoded digital information 
in a bitmap formatted representation of randomized data 
bits having a selected pixel row and column length, with 
"black" data pixels in each comer and no border sur- 
rounding the barcode. In another embodiment of this in- 
vention, there is no requirement that 'black' data pixels 
occupy each corner of the barcode. 
[0019] Advantageously each data bit within each 
block in said randomised encoded information is sepa- 
rated from every other data bit within the same block by 
at least one row and one column in said two-dimensional 
barcode. 

[0020] As a further aspect of this invention, a method 
of encoding Information, for printing on a printed medi- 
um, such as paper. Is described. Information is first con- 
verted Into a data stream having a sequence of binary 
characters representing data bits. The data stream is 
then randomized to produce approximately an equal 
number of "0" and "1 " characters. Thereafter, the rand- 
omized data stream is formatted into a two-dimensional 
bitmap of rows and columns and printed onto the printed 
medium. 

[0021] In another aspect of the invention, information 
is converted into a data stream having binary characters 
representing data bits. The data stream is mapped to 
produce digital data bits arranged In a two-dimensional 
bitmap of rows and columns having the four comer lo- 
cations vacant. The four comer locations are set to the 
logical character representing a "black" data pixel. 
[0022] Another aspect of this Invention relates to a 
method for mapping the data stream representing the 
information to be encoded to produce digital data bits 
arranged in a two-dimensional bitmap of rows and col- 
umns such that each data bit of a given block is sepa- 
rated from every other data bit within the same block by 
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at least one row and one column, which ensures that 
the bitmap is robust with respect to row and column 
damage. 

[0023] As a further aspect of this invention, a method 
for ensuring that the information encoded into the two- 
dimensionat barcode is adequately randomized is pre- 
sented. A number of alternate keys are available for ran- 
domization during the encoding process, so that if a first 
key does not produce an adequate level of randomiza- 
tion, a second key can be used. This process of testing 
additional keys continues until the randomization reach- 
es a predefined level. During decoding, the error cor- 
recting code is used to select the most appropriate key 
for derandomization, as the improper key will produce 
excessive errors. 

[0024] Preferably, in all embodiments of the two di- 
mensional barcodes of this invention, the encoded in- 
formation printed on the printed medium is randomized 
to safeguard against strings of contiguous pixels being 
the same color which may occur without randomization. 
[0025] In another embodiment of this invention, infor- 
mation printed on a printed medium in the form of a two- 
dimensional bitmap of rows and columns of data pixels 
representing encoded data bits is decoded. The printed 
medium contains other information in addition to the bit- 
map. The printed medium on which the bitmap is printed 
is scanned, and the bitmap is located within the printed 
medium. The skew angle of the bitmap is determined 
with respect to the origin of the scanned printed medium, 
and if the angle is too large, the bitmap is deskewed. 
Once the bitmap is correctly aligned, it is cropped and 
the binary data from the data pixels is read to produce 
a two-dimensional digital array corresponding to the en- 
coded data bits and corner bits, if present, respectively. 
[0026] In a further embodiment of this aspect of this 
invention, a method of ensuring that the data is properly 
read from the deskewed, cropped bitmap is presented, 
whereby four alternate data streams are created by 
reading the data in four attemate directions, and the er- 
ror correcting code is then used to select the data stream 
with the fewest errors for output. 

Brief Description of the Drawings 

[0027] The following detailed description, given by 
way of example and not intended to limit the present In- 
vention solely thereto, will best be understood in con- 
junction with the accompanying drawings in which: 
[0028] Figure 1 A is a diagram schematically illustrat- 
ing the two-dimensbnal barcode of the '280 Application 
printed on a page of printed text, and Figure IB shows 
an example of the two-dimensional barcode of the '280 
Application. 

[0029] Figure 2 shows an example of a two-dimen- 
sional barcode in accordance with the present invention. 
[0030] Figure 3 Is a flow chart showing the steps for 
encoding and decoding information onto a printed me- 
dium in accordance with the present invention. 



[0031] Figure 4 shows an example of a bad data bit 
layout for the two-dimensional barcode. 
[0032] Figures 5 A and 5B show an example of a better 
data bit layout for the two-dimensional barcode. 
s [0033] Figure 6 shows an example of optimal data bit 
layout for the two-dimensional barcode in accordance 
with the present invention. 

[0034] Figure 7 A shows an example of a randomized 
two-dimensional barcode with "BAD" randomization re- 
10 suits, and Figure 7B shows an example of a randomized 
two-dimensional barcode with "GCX^D" randomization 
results. 

[0035] Figure 8A is a flowchart describing how to read 
one embodiment of the two-dimensional barcode in ac- 
15 cordance with the present invention, and Figure SB is a 
flowchart describing how to read a second embodiment 
of the two-dimensional barcode in accordance with the 
present invention. 

[0036] Figures 9 A and 9B illustrate how the two-di- 
20 mensional barcode is located within the printed page in 
accordance with the present invention. 
[0037] Figures 10A and 10B illustrate how the skew 
of the two-dimensional barcode is calculated in accord- 
ance with one aspect of the present invention. 
25 [0038] Figure 11 illustrates a parametric representa- 
tion of a straight line. 

[0039] Figure 12A and 12B illustrate representations 
of a single point within a plane, and two points within a 
plane, respectively. 
30 [0040] Figures ISA and 13B show the alignment of 
edge pixels in ideal and skewed two-dimensional bar- 
codes of the present invention. 

[0041] Figures 14A and 14B illustrate the four edges 
of a PanaMark segment and the two masks used to de- 
35 tect the edge pixels in the Hough Transform-based skew 
detection procedure in accordance with one aspect of 
the present invention. 

[0042] Figure 1 5A shows a skewed two-dimensional 
barcode and Figure 1 5B shows the edges detected with- 
40 in the two-dimensional barcode of Figure 15A by using 
the Hough Transform In accordance with the present in- 
vention. 

[0043] Figure 16 illustrates the simple deskewing 
process for the two-dimensional barcode in accordance 
45 with of the present invention. 

[0044] Figure 17 shows two ways of applying skew 
correction to the two-dimensional barcode in accord- 
ance with the present invention. 

[0045] Figures 18A-18D illustrate the four alternate di- 
50 rectlons for reading the data from the two-dimensional 
barcode in accordance with the present invention. 

Detailed Description of the Preferred Embodiments 

55 [0046] As fully described In U.S. Patent Nos. 
5,625,721 and 5,703,972 to LoprestI et al. which are 
both entitled 'Certifiable Optical Character Recognition" 
and in the co-pending patent application entitled "A 
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Method and Means For Enhancing Opticai Character 
Recognition of Printed Documents' (Serial No. 
08/138,467. filed October 15. 1993), currently under al- 
lowance, the contents of which are all explicitly incorpo- 
rated by reference herein, infomnation about the con- s 
tents, layout, generation and retrieval of a document can 
be encoded by a computer when Initially generating the 
document or upon subsequent computer processing 
thereof. The encoded document information can then 
be provided via a data barcode, which is also referred 
to herein as a "PanaMark"®, generated on the face of 
a printed version of the document. Advanced encoding 
and print resolution capabilities presently available can 
accommodate up to 30,000 bits of information in a single 
square inch of space. Therefore, as taught by the above- 
referenced applications, one can theoretically encode 
the entire document contents, limited only by the 
amount of space on the document face that one is willing 
to sacrifice to the PanaMark. A barcode scanner, in con- 
junction with or wholly separate from an optical page 
scanner, can scan the PanaMark and provide the infor- 
mation to an associated system equipped with the ap- 
propriate recognition and decoding software. The de- 
coded information can then be used by the scanning 
system to create a new version of the document or to 
enhance the recognition, reproduction and error correc- 
tion for the scanned document. To decode the Pan- 
aMark. it is not required that such barcode scanner and 
scanning system know the printing resolution of the 
PanaMark, provided that the scanning resolution of the 
scanner is at least as fine as the printing resolution of 
the PanaMark, so that each individual 1 x 1 or larger 
pixel of the PanaMark. representing a logical bit, can be 
scanned. 

[0047] The information encoded in the form of a Pan- 
aMark can be used to enhance the software tools al- 
ready used to create paper documents. Examples in- 
clude word processors, spreadsheets, object-oriented 
graphics, and multimedia applications, such as voice re- 
cording and photographic imaging. 
[0048] The border 1 3 used in the PanaMark 1 0 of FIG. 
1 was not a critical feature of the invention disclosed in 
the '280 Application, as most of the key procedures de- 
scribed therein work whether or not a border is present. 
However, the border 10 was used in the '280 Application 
by the skew estimation and deskewing steps. 
[0049] FIG. 2 illustrates an example of the new Pan- 
aMark symbology PanaMark 20 comprises an encoded 
set of data bits in a two-dimensional grid. Typically, each 
data bit which is encoded is printed as a matrix of black 
or white pixels 23. Preferably, a pixel matrix represent- 
ing one data bit is square and may be as small as a 1 x 
1 matrix to as large as a 6 x 6 matrix or more. Non- 
square matrices may also be used. There are no clocks 
or borders needed or required in the symbology for Pan- 
aMark 20. In the preferred embodiment, the PanaMark 
20 is a 20 X 20 array of data bits, although it can be 
recognized that the size is flexible and that the only re- 



quirement on the size is that the reading process know 
the size of the encoded array. In some applications, it 
may be desirable to include a text label 22 adjacent to 
the PanaMari^ 20. although optional and of no signifi- 
cance to either the encoding or decoding process. 
[0050] Two different embodiments are described 
herein. In the first embodiment, the four corner bits 21 
are always black and are necessary for the template 
matching skew estimation process discussed in greater 
detail with respect to FIG. 10 The four corner bits 21 in 
the first embodiment are called "anchor" bits. The re- 
maining data bits in the first embodiment are pseudo- 
randomized and can hold any combination of desired 
information and error correction bits. The symbology of 
the first embodiment provides for good skew estimation 
when the skew is small and the PanaMark 20 is free 
from any damage. However, the placement of the an- 
chor bits 21 in the corner makes them susceptible to 
damage. Thus, in the second embodiment, there is no 
requirement for any anchor bits 21 and the PanaMark 
20 is simply a N x M array of data bits, preferably with 
N = M = 20. In the second embodiment, all of the data 
bits are pseudo-random ized and can hold any combina- 
tion of desired information and error correction bits. 
[0051] Figure 3 illustrates the steps involved in the en- 
coding/decoding process. Except as discussed herein, 
each of the steps are more particularly described in the 
'280 Application. During the encoding process, input da- 
ta in the form of a one-dimensional linear bitstream is 
first processed to add a standard, block-based error cor- 
rection code CECC) at step 30, randomized at step 31 , 
mapped from a one dimensional bitstream to a two-di- 
mensional representation, the PanaMark, at step 32, 
and the PanaMark is finally printed at step 33. The de- 
coding process repeats these steps in reverse order, the 
printed PanaMark is read at step 34. mapped from a 
two-dimensional to one-dimensional representation in 
step 35, derandomized at step 36. and finally the ECC 
is applied at step 37 to recreate the "raw" linear bit- 
stream. As part of the new PanaMark symbology pre- 
sented herein, improved methods for randomizing, 
mapping, reading, and applying the ECC are also pre- 
sented. 

[0052] Any ECC can, of course, be implemented In a 
manner which allows some number of bit errors to be 
corrected. While such errors are generally unavoidable, 
the method by which data is mapped into two dimen- 
sions can be chosen to minimize the impact of certain 
kinds of errors on the readability of the PanaMark as a 
whole. The bits are mapped so that systematic damage 
to the mark is not likely to cause more than one bit error 
In each block, or codeword, which is what the ECC can 
correct. 

[0053] One method, shown in FIG. 4, of placing the 
bits from a single block 40 sequentially in row- or col- 
umn-major order within barcode 41 , is a poor choice. 
When an image is scanned, it is not uncommon to see 
correlated damage along specific rows or columns of 
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pixels, as depicted in row 42. In the case of flatbed scan- 
ners and fax machines, row damage arises as a result 
of physical motion; either the CCD sensor array being 
moved down the page, or the page being drawn past 
the stationary sensors. Any irregularity in this motion will 
lead to distortion across certain rows of scanned image. 
Note that row 42 experiences seven bit errors and hence 
is totally lost. On the other hand, column damage can 
occur when certain of the CCD sensors have gone bad, 
leading to black stripes running down the length of the 
page. 

[0054] As shown in FIGS. 5 A and 5B, a much better 
strategy is to place successive bits from a single block 
along diagonals within barcode 45. In this case, only one 
bit In each block Is affected by damage to correlated row 
46 or column 47. Since the ECC can correct one bit error 
per block, this means that all the blocks are decoded 
successfully. By mapping the blocks so that the damage 
Is distributed uniformly across them, the overall robust- 
ness of the symbology is Improved. 
[0055] It is even more desirable if bits from a single 
block can be placed in non-adjacent rows and columns, 
because the noise effects described earlier could poten- 
tially straddle row or column boundaries. As long as the 
basic block size is less than half the dimension of the 
mark, this Is possible. This is accomplished by the map- 
ping process depicted in FIG. 6. The barcode 50 illus- 
trated in FIG. 6 shows how block zero 51 is mapped on 
the major diagonal of logical PanaMark 50 with each bit 
separated by at least one row and one column. Hence, 
even damage that destroys two adjacent rows or col- 
umns will not, by itself, prevent the correct decoding of 
all of the blocks. Although FIG. 6 depicts a logical Pan- 
aMark 50 with four anchor bits 52, it will be recognized 
that the anchor bits 52 are only necessary when the tem- 
plate matching skew angle detection method Is used, as 
Is discussed herein with respect to FIGS. 10A and 10B. 
[0056] In exchange for doing away with explicit clock 
borders, the reading procedure described in the '280 
Application requires the presence of sufficient edge 
transitions to synthesize the implicit clocks. Large areas 
of all-white or all-black pixels within the PanaMark are 
problematic, because of the consequent reduction In the 
number of edges. In practice, such regions might arise 
due to regular patterns in the underlying data unless 
steps are taken to prevent it. For instance, English text 
represented in standard 8-bit ASCII always has a '0' in 
the high-order bit position. FIGS. 7A and 7B give exam- 
ples of a "bad" PanaMark 60 and a "good" PanaMark 
61. respectively. Both PanaMarks 60 and 61 are shown 
with the optional text label 62. Note that PanaMark 60 
is perfectly valid - it encodes the same number of bits 
as PanaMark 61 . PanaMark 60 would, however, be im- 
possible to read reliably using the techniques presented 
in the '280 Application. 

[0057] Such situations can be prevented from occur- 
ring by randomizing the input bitstream. This means 
that, on the decoding side, the bitstream must be de- 



randomized. This process should actually be called 
"pseudo-randomization" because it is always known 
how to undo its effects. A simple, well-known approach 
to Implement randomization is to perform an exclusive 

5 "OR" ("XOR") operation on the input stream bit-by-bit 
with a predefined, randomly generated key. This has the 
effect of breaking up any regular patterns in the input 
stream, yielding a roughly unifomn distribution of O's and 
Vs. When encoded in the PanaMark, these correspond 

10 to white and black bits, respectively. In this way, the 
"good" PanaMark 61 is created which has sufficient edg- 
es for the implicit clocking necessary to read the data 
bits. The randomization step also gives the mark a non- 
descript, gray appearance that users may find less dis- 

15 tract In g than readily recognizable patterns. 

[0058] During the decoding process, the output bit- 
stream is de-randomized by XOR'ing it with exactly the 
same key. Hence, it is vital that the encoding and de- 
coding procedures use the same key It is possible, in 

20 rare cases, that the randomized bitstream will be degen- 
erate such that it contains large white or black regions 
as illustrated by the "bad" PanaMark 60 of FIG. 7A. To 
avoid this scenario, the encoding procedure can option- 
ally choose from among a small assortment of possible 

25 keys. The selection is guided by the quality (i.e., the ran- 
domness) of the output bitstream. Even though the de- 
coding procedure cannot know a priori which key the 
encoding side has used, it is a simple matter to try each 
key and confirm the results using the output from the 

30 ECC process, Le., the correct key will be the one that 
produces the fewest number of errors from the ECC 
process. 

[0059] FIGS. 8A and 8B are flowcharts illustrating the 
reading procedures for the barcode symbok3gies pre- 

35 sented herein. The method described by FIG. 8A is di- 
rected to the embodiment of the PanaMark in which no 
border or anchor bits are present, while the method de- 
scribed by FIG. 8B is directed to the embodiment of the 
PanaMark which includes anchor bits, but no border. 

40 Printing is handled by a complementary set of routines 
that are more straightforward as no image processing 
is involved, and are fully described In the '280 Applica- 
tion. 

[0060] In FIG. 8A, the first step 70 locates the Pan- 
45 aMark, which has no border or anchor bits, on the print- 
ed page. Under most circumstances, it is assumed that 
the region around the mark is relatively noise-free; ordi- 
narily It Is positioned in an unused corner of the page 
(as, for example, in FIG. 1 A). As more particularly dis- 
50 cussed with respect to FIGS. 9A and 9B, the locate step 
70 operates by first calculating vertical and horizontal 
histograms of the number of black pixels in the scanned 
page. Because the scanned PanaMark is randomized 
and therefore has an approximately equal distribution of 
55 black and white pixels. It will have a greater number of 
black pixels than is normally present within the region 
of the document in which it Is positioned. Each histo- 
gram is sequentially examined, row by row for the ver- 
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tical histogram and column by column for the horizontal 
histogram, until a first predetermined level is reached. 
Next, the process "backs up slightly' until a second low- 
er predetermined level is reached. This is selected as 
the first boundary. Then, moving forward from where the 
first predetermined level was first found, the histogram 
is examined to find the point where it drops below the 
first predetermined level. The process next 'moves for- 
ward slightly' until the second predetermined level is 
reached, similar to the 'backing up" process described 
above. This point Is selected as the second boundary of 
the mark. By performing this process on both the hori- 
zontal and vertical histograms, a box bounding the Pan- 
aMark is selected. The 'backing up slightly" and 'moving 
forward slightly" steps ensure that the corners are not. 
cut off, especially in the case of a skewed PanaMark. 
[0061] In step 71 of FIG. 8A, the skew of the Pan- 
aMark is estimated using the "Hough Transform" proce- 
dure. This procedure is described in greater detail with 
respect to FIGS. 11-13. In this procedure, horizontal 
edges are first found for each row of logical data pixels. 
By using the Hough Transform, a representation for a 
line which is in parallel with the horizontal edges is found 
(which, if skewed, will not be purely horizontal), and the 
angle between this line and a purely horizontal line is 
calculated, giving the skew angle. 
[0062] At step 72, the skew angle is evaluated. If the 
skew angle 0 is below a certain predetermined threshold 
a. no deskewing is required, and the cropping step 75 
is performed next. If the skew angle 4> is greater than the 
predetermined angle ot, then the trigonometric deskew 
step 74 is performed next. As is more particularly de- 
scribed with respect to FIG. 17, once the skew angle is 
known, it is possible to calculate the position of the 
deskewed image mathematically based upon the 
skewed image. 

[0063] Once the deskewing, if any, is completed, at 
step 75 a tight bounding box is cropped around the Pan- 
aMark, which is now oriented with clean horizontal and 
vertical edges. The cropping step is performed in a man- 
ner much like the locate step 70, but with tighter toler- 
ances. Finally, at step 76, the bits are read out, yielding 
an N X M bit array (20 x 20 bit in the preferred embodi- 
ment). The read process 76 is more particularly de- 
scribed in the '280 Application and with respect to FIG. 
18. 

[0064] In FIG. 8B, the first step 70 locates the Pan- 
aMark, which has anchor bits but no border, on the print- 
ed page, in the same manner as is generally described 
above and more particularly described with respect to 
FIGS. 9 A and 9B. As previously mentioned with respect 
to FIG. 8 A. it is assumed that the region around the mark 
is relatively noise-free. 

[0065] In step 71, the skew of the PanaMark is esti- 
mated. Two alternate methods for estimating the skew 
are described herein. The first method is a "template 
matching" procedure which is only effective for small an- 
gles of skew (it is not used in FIG. 8A because it requires 



that the PanaMark to be decoded have anchor bits in 
each corner), which is more parttoularly described with 
respect to FIGS. 10A and 10B. Generally, this process 
operates by aligning two templates closely to two adja- 

5 cent corners of the PanaMark, for example, the left top 
corner and the left bottom corner. Each template forms 
a right angle and preferably has an inner row of black 
bits and two outer rows of white bits. Each template is 
moved bit by bit towards the respective comer, until the 

10 closest possible match is found, aligning the inside row 
of black bits with the respective comer. Although the 
PanaMark may be skewed, each template remains 
aligned vertically and horizontally as it is matched with 
the respective corner. Once the closest match is found 

15 tor each corner, if the PanaMark is skewed, the black 
corner bits of each template will not be aligned vertically. 
The angle between the vertical (or horizontal, if the top 
left and top right comers were selected instead) line ex- 
tending from the comer bit of the first template, and the 

20 line extending from the corner bit of the first template to 
the corner bit of the second template represents the 
skew angle, and can be calculated by simple trigono- 
metric methods. 

[0066] In the second method, the "Hough Transfomi" 

25 is used to calculate the skew angle. This method, which 
is generally described above with respect to FIG . 8 A and 
more particularly described with respect to FIGS. 11 -1 3, 
is effective for a larger range of skew angles (and does 
not require that the PanaMark have anchor bits). The 

30 choice of template matching procedure is a tradeoff be- 
tween the amount of skew to be corrected and the 
processing time allotted, because the Hough Transform 
procedure requires more processing time. If skew an- 
gles greater than can be detected using the template 

35 matching procedure are expected, the Hough Trans- 
form method should be used. If processing time is at a 
premium, and smaller skew angles are expected, the 
template matching procedure should be used. In step 
72, the skew angle is evaluated. If the skew angle ()> is 

40 below a certain predetermined threshold a, no deskew- 
ing is required, and the cropping step 75 is performed 
next. If the skew angle is greater than the predeter- 
mined angle p. then the trigonometric deskew step 74 
is performed next, as is generally described with respect 

45 to FIG. 8A, and more particularly described with respect 
to FIG. 1 7. If the skew angle ^ is between predetermined 
angles a and p, then the simple deskew step 73 is per- 
formed next. The simple deskew method uses a shear 
rotation technique. In this process, one corner bit, for 

50 example, the upper left corner bit is assumed to be cor- 
rectly positioned. Based upon the skew angle, each row 
is sequentially examined to determine the number of 
pixels to be shifted, right to left for example. Each col- 
umn is likewise sequentially examined, to determine the 

55 number of pixels to be shifted, top to bottom for example. 
Next, the pixels are shifted as designated in each row, 
the finally the pixels are shifted as designated in each 
column. 
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[0067] As described with respect to FIG. 8 A. once the 
deskewing, if any, is completed, a tight bounding box is 
cropped around the PanaMark at step 75 and the bits 
are read out at step 76, yielding an N x M bit array (20 
X 20 bit in the preferred embodiment). 
[0068] It is possible, at various steps in the process, 
to check assertions as to whether a valid PanaMark has 
been detected. For example, the PanaMark is expected 
to fall within a certain size range, and to obey specific 
properties with respect to its texture (roughly 50/50 
black and white). In FIGS. 8A and SB, error checking is 
shown with respect to the locate step 70, the crop step 
75 and the read step 76. In later stages, after the read 
step 76, the ECC can also be used to confirm validity. If 
at any point an expected characteristic is not found, 
there are two options: (1 ) back up and try a different ap- 
proach, or (2) abort further processing. Note that it is 
critical for the default steps to be very fast. Only when 
special processing is required should more time be 
granted to read the PanaMark. This is why in FIG. SB 
there are two alternate methods presented for perform- 
ing the deskew step, the trigonometric deskew step 74 
is used only when necessary, and normally the simple 
deskew step 73 is used, which operates more quickly 
than the more complicated trigonometric deskew step 
74. However, as shown in FIG. 8 A, for the PanaMark 
without anchor bits, only the more complex procedures 
for estimating skew angle can be used. 
[0069] FIG. 9 illustrates the two steps that are re- 
quired to carry out the locate step 70. The goal of the 
locate step 70 is to find a bounding box for the PanaMark 
that is assumed to be somewhere in the input image, for 
example, PanaMark 10 shown in FIG.1A in the lower 
right corner of document 11. Normally, the PanaMark is 
the only object present at that particular position on the 
page, other than an optional text label. In the event the 
PanaMark is embedded within the content of the image, 
or other large components are present in the input im- 
age, more sophisticated procedures based upon mor- 
phological operations and statistical analysis must be 
used. Such cases are likely to be rare in practice, how- 
ever. During this process, it must be anticipated that 
there will be a small amount of "speckle" noise and that 
the entire image may be skewed. 
[0070] The locate process illustrated in FIG. 9 is 
based on y-axis and x-axis projections. First, with refer- 
ence to FIG. 9A, the vertical extent (Le.. the top and bot- 
tom boundaries) of the PanaMark 90 is determined. To 
that end, the number of black pixels in each row of the 
input image on which the PanaMark 90 is printed is 
summed, creating a y-axis projection 104. Processing 
proceeds from top to bottom on the y-axis projection 
104. In the embodiment shown, the label is below the 
PanaMark, although as one skilled in the art can under- 
stand, if present, it can be located on any one of the four 
sides of the PanaMark 90. A minium threshold 100 is 
set so that small amounts of spurbus noise are ignored. 
When a row is found that meets the minimum threshold 



100, shown by point 91 , the threshold is reduced slightly 
and the top boundary 1 02 is found by moving backwards 
in the histogram 104 until the reduced threshold is met 
(point 92). This "reducing" of the threshold 100 prevents 

5 the chopping off of the corners of the PanaMark, in the 
event that the mark were skewed and the threshold set 
too tight. Next, processing continues downward, as 
shown in point 93, until the histogram 104 falls below 
the minimum threshold 1 00. At this point, the threshold 

10 is reduced by the same amount as before to establish 
the bottom boundary 103 at point 94, although here the 
processing continues downward, instead of moving 
backwards. 

[0071] Once the correct vertical extent has been de- 
15 termined (effectively eliminating the label), a similar op- 
eration is performed to determine the horizontal extent 
(i.e. , the right and left boundaries) of PanaMark 90. 
Black pixels are summed for each column across the 
currently active rows to create x-axis projection histo- 
gram 101. Processing proceeds from left to right. At 
point 95, the processing stops when the minimum 
threshold level 1 00 is met. The threshold is then reduced 
slightly, and the left boundary 105 is selected by moving 
backwards along histogram 101 until the reduced 
threshold is met at point 96. Then, processing continues 
from point 95, where the minimum threshold level 100 
was found, now looking for a level less than the thresh- 
old 100. Once this is found at point 97, the threshold is 
again reduced by a set amount, and the right boundary 
106 is selected at point 98 by continuing to move left to 
right until the reduced threshold is found. Of course, the 
order of finding the vertical and horizontal boundaries of 
the PanaMark can be reversed. 
[0072] After the locating process is completed, it is as- 
sumed that an accurate bounding box for the PanaMark 
has been determined, which can be confimied in a 
number of ways, e.g. . the ECC process. 
[0073] Once the PanaMark is located within the page, 
it is necessary to estimate its skew angle, if any This is 
required because later processing steps (i.e. . reading) 
depend on the mark being nearly perfectly straight. 
Even though the user may be careful when placing the 
page on the scanning device, small amounts of skew 
can still arise. Once the amount of skew is known, a 
deskewing step can be applied to properly align the Pan- 
aMark. Two altemate methods are presented herein for 
estimating the skew angle, as described with respect to 
block 71 of FIG. 8. The simpler method, which requires 
that the PanaMark include anchor bits, is the template 
matching skew estimation procedure. The template 
matching skew estimation procedure requires much 
less processing time than the more complex Hough 
Transform procedure, but only the Hough Transform 
procedure can be used to calculate the skew angle of 
PanaMarks without anchor bits. 
[0074] FIG. 10 illustrates the template matching pro- 
cedure for skew estimation (step 71 of FIG. 8B), which 
works reliably for small skew angles, such as are en- 
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countered when the user exercises reasonable care in 
placing the page containing the PanaMark 110 (with four 
anchor bits) on the glass of the scanning device, or when 
a scanning device incorporating an automatic sheet 
feeder is used to scan the page containing the Pan- s 
aMark 110. As previously mentioned in conjunction with 
step 71 of FIG. 8B, skew angles too large to be deter- 
mined by the template matching procedure can be cal- 
culated using the Hough Transform technique to be de- 
scribed in conjunction with FIGS. 11-12. 
[0075] Referring to FIGS. 10A and 10B. templates 
111, 112 are defined for the northwest (N W) and south- 
west (SW) corners of the PanaMark 110, respectively. 
Each template 111, 112 preferably includes one row of 
black pixels 117 and two rows of white pixels 113. As 
shown in the left poitbn of FIG. 10, each template 111, 
112 Is moved across the appropriate subregion of Pan- 
aMark 110 in a bitwise manner to locate the corners. 
This process is indicated by arrows 118 showing the 
movement of template 111 toward the northwest corner, 
and by arrows 119 showing the movement of template 
1 1 2 toward the southwest comer. At each point of move- 
ment, a 2-D Hamming distance is computed as is well 
known in the art (I.e.. the number of pixel locations in 
which the template and the image do not match), and 
each template 111, 112 is adjusted back and forth until 
this distance is minimized for each corner. 
[0076] Once the templates 111, 112 are in their best- 
fit positions, the angle 114 between the corner pixels 
115, 116 in the templates 111. 112 is determined as 
shown in FIG. 1 0B in a manner well-known in the art. 
thereby yielding an estimate for the skew angle 4> of the 
PanaMark 110. 

[0077] Note that while the anchor bits in the PanaMark 
110 may be skewed, the templates 111, 112 are not (L 
e^, the templates are pre-defined, and the skew angle is 
not a pr/on known). For this reason the procedure is re- 
liable only for small skew angles. The skew over a single 
bit is relatively minor compared to the skew over the en- 
tire dimension of the PanaMark 110. Hence, the tem- 
plates 111,112 will match well in their intended positions 
so long as the anchor bits in the image are not signifi- 
cantly distorted {le., the skew angle <t) is not too large). 
[0078] The Hough Transformation can also be used 
to determine the skew angle for levels of skew angle 
which the template matching procedure is incapable of 
handling and for PanaMarks without anchor bits. The 
Hough Transformation is one of the oldest and most ro- 
bust methods used in image analysis and computer vi- 
sion. It is a technique for detecting geometric features 
such as straight lines, circles, etc., in a digital image. 
The Hough Transformation is a parametric transforma- 
tion which maps a set of points of a shape into a small 
set of parameters characteristic of that shape. For ex- 
ample, as shown in FIG. 11 , points on a straight line 120 
can be mapped to a pair of parameters (p, 6) by its par- 
ametric equation: 



p = >rcos(8) + ysin(8) 

where p. 1 21 . is the normal distance to the line from the 
origin, and 8, 1 22, is the angle of the normal of the line 
with the X axis. 

[0079] The above correspondence between all points 
on a straight line and a parameter pair (p, 6) assumes 
that (p. 9) of the straight line is known. As shown In FIG. 
12A. an arbitrary point (x, y) 127 in a plane, however, 
can be on an inrinite number of straight lines, as depict- 
ed by lines 1 23a-1 23d. In other words, a point (x, >0 1 27 
can be mapped to an infinite number of parameters (p. 

8) . The set F^y - {(p, 8) I 0 < 8 < 180^ p= x cos{Q) + y 
s/n(0)} represents all straight lines passing through (x. 
y)127. 

[0080] On the other hand, as shown in FIG. 12B. for 
any two points (x^^ Yq) 128 and (x,, y^) 129 there is a 
straight line 124 passing through both. Accordingly, 
there exists one pair of parameters (p*. 8*) common to 
both set Fjrtjj^and set F^^j^p such that: 

[0081] Subsequently, if many points (Xf, y^) in a plane 
form a straight line of parameters (p*, 8*). the parame- 
ters shall be a member of all corresponding sets F ^yf. 
In other words, given a set of points in a plane, the 
Hough Transformation can be used to detect the exist- 
ence of a straight line of given parameters by calculating 
the frequency that the parameters appear in set F„'yi 
over all points. 

[0082] In practice, the line detection algorithm based 
upon the Hough Transform requires a 2-D accumulator 
array representing the quantized parameter space (p, 
6). To detect the lines, the procedure examines every 
foreground pixel (x. y) in the image, and calculates all 
quantized values (p, 8) In the set F^^y The resulting (p. 

9) entries in the accumulator array are incremented. 
[0083] After all pixels have been processed, the ac- 
cumulator array is searched for high value peaks. These 
peaks indicate the parameters of the most likely straight 
lines in the image. The highest peak represents the pa- 
rameters of the "dominant" straight line in the image. 
[0084] As shown in FIG. 1 3A, when scanned perfect- 
ly, the edge pixels of a PanaMark 1 30 corresponding to 
the logical bits in a same row in a PanaMark 130 reside 
on the same scanline 132. As shown in FIG. 13B, when 
the PanaMark 131 Is skewed, these edge pixels align 
roughly in the direction of skew, and thus the edge pixels 
form straight lines, such as parallel scanline 133 and 
perpendicular scanline 134 with respect to the orienta- 
tion of PanaMark 1 31 . Thus, the Hough Transform pro- 
cedure can be used to identify these straight lines, such 
as scanlines 133 and 134, and subsequently, deduce 
the skew angle of PanaMark 1 31 . 

[0085] The first step of the skew detection procedure 
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based on the Hough Transform is to identify the edge 
pixels of within a PanaMark. The blacic and white pat- 
terns within a PanaMark have four sides (called 'edges' 
herein). Figure 14A shows the four edges for a black 
pattern 140, which are aligned in two orthogonal orien- 
tatkxis, i.e., horizontal edges 141 and vertical edges 

1 42. For the purpose of skew estimation, it is sufficient 
to detect edge pixels in only one orientation, for exam- 
ple, only horizontal edges 141. 

[0086] As shown in FIG. 14B, two edge masks 143. 
146 are applied to a PanaMark pattern to identify the 
horizontal edge pixels. Although one edge mask would 
be adequate to perform skew estimation, in the pre- 
ferred embodiment disclosed herein two edge masks 

143, 146 are used to provide robustness to the identifi- 
cation procedure. Any black pixel locations whose black 
and white pattern in vertical position matches either of 
the masks are considered edge pixels. The width of the 
mask 1/V 145, 146 is preferably set to be one-half of the 
number of pixels of a PanaMark logical bit in the 
scanned image. 

[0087] FIG. 1 5A shows a scanned PanaMark 1 50 with 
a skew angle of 10° and FIG. 15B shows the horizontal 
edge pixels 151 detected in PanaMark 150. Once the 
edge pixels 151 are identified, the Hough Transform pro- 
cedure is applied to detect the "dominant" straight line 
in edge image 152. The skew of the PanaMark 150 is 
then calculated as the skew of the dominant line in edge 
image 1 52. 

[0088] After the skew angle has been estimated, us- 
ing either of the procedures just described, it may nec- 
essary to apply a corrective deskewing step. As with the 
other steps involved in reading a PanaMark, there are 
a number of different ways to accomplish this. 
[0089] As shown in FIG. 16, a 'shear rotation" proce- 
dure can be used to deskew the PanaMark for relatively 
small levels of skew. This is a one-dimensional approx- 
imation to a true, two-dimensional trigonometric rotation 
which would be more costly in terms of computing time 
to compute. Shear rotation works well for the kinds of 
small skew angles usually expected. If larger values are 
detected, the more complex and hence more time con- 
suming process described with respect to FIG. 17 must 
be used. Two separate one-dimensional steps are per- 
formed which amount to sliding the appropriate rows 
and columns of the PanaMark: a horizontal shear fol- 
lowed by a vertical shear. 

[0090] The shear rotation process begins by assum- 
ing that the upper-left-most pixel 163 of the PanaMark 
is correctly placed. Based on the skew estimate deter- 
mined earlier, a compensating number of pixels to shift 
each successive row is computed, working down from 
the top. Likewise, a compensating number of pixels to 
shift each successive column is computed, again as- 
suming the upper-left-most pixel is correctly placed. The 
rotation is then performed by shifting first the rows, then 
the columns, as depicted in FIGS. 16A and 16B. Note 
that FIGS. 16A and 16B combine the separate steps of 



calculating the shift with the actual rotation. The first 
step, as shown in the horizontal shear step, is to calcu- 
late working from the top the number of compensating 
pixels to be shifted for each row of image 1 60. As indi- 
s cated to the left of innage 160 in FIG. 16A, the top four 
rows require no shift, the next two rows require a one 
pixel shift, and the two bottom-most rows require a two 
pixel shift. The next step, as shown in the vertical shear 
step, is to calculate working from the left the number of 
10 compensating pixels to be shifted for each column of the 
original image 160. As indicated below image 161 of 
FIG. 1 6B, the first two columns require no shift, the next 
two columns require a one pixel shift, and the final two 
columns require a two pixel shift. Next, the horizontal 
15 shear process is performed on image 1 60 by shifting the 
lower four rows of pixels. Finally, the vertical shear proc- 
ess is performed by shifting the right four columns of 
pixels by the calculated amounts, giving the final result 
shown in FIG. 16C. The actual PanaMark is only a 6 x 
6 bit array, and the outer white regions shown in Pan- 
aMark 162 will be eliminated during the cropping step 
75 depicted in FIG. 8 and discussed below. 
[0091] The example used in FIG. 16 was chosen to 
illustrate the two shearing operations, but the coarse 
resolution of the bitmap and the large degree of skew 
result in a misleading amount of distortion in the output 
image, which does not occur in practice. 
[0092] As shown in FIG. 1 7, to implement the trigono- 
metric deskewing step 74 described in FIG. 8. the rela- 
tionship between a two-dimensional image 171 and its 
skewed (or rotated) counterpart 170 can be described 
as an affined transformation of coordinates by the well- 
known equations: 

V = xcose - ysine (1) 



y = xsine + ycose (2) 

where (x. y) and (x',y) are the pixel coordinates in the 
original image 171 and the skewed image 170, respec- 
tively, and e is the skew angle. 

[0093] Consequently, once 6 is known, the original im- 
age 171 can be recovered from its skewed version 170 
by the inverse transformation: 

X = VcosO + y sine (3) 



y = - y sine + ycose (4) 

[0094] A straightforward way of implementing skew 
correction is to compute the deskewed image 171 coor- 
dinate (x, >) for each black pixel {x\y) in the input image 
170 using the inverse transformation (equations (3) and 
(4)), as shown by the dotted line 173 in FIG. 17. Each 
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pixel (x, y) is then set to the be the color of pixel (x\ y). 
[0095] Alternatively, the operation can be done in the 
opposite way: for every pixel (x, y) in the goal image, its 
position in the input image (x'y) is computed via equa- 
tions (1 ) and (2), as shown by the dotted line 1 72 of FIG. 
17. The pixel color is then copied from the computed 
location to (x, y). 

[0096] From the mathematical point of view, the two 
implementations are equivalent. For digital images, 
however, they are not identical. Both methods have their 
shortcomings. Due to numerical round-ups. the first im- 
plementation may generate small holes In thedeskewed 
image, while the second implementation may result in 
misplaced edges. Although each method may provide 
satisfactory results, the second method is the preferred 
method. 

[0097] After the PanaMark has been deskewed, the 
bounding box calculated with respect to FIG. 9 sur- 
rounding the PanaMark may be too "loose", Le., there 
are more pixels within the bounding box than is needed 
to extract the data bits. For example, as shown in FIGS. 
16A-16C. if the original bitmap measures 8x7 pixels 
and the final deskewed bitmap measures 6x6 pixels, 
two columns and one row of pixels must be discarded. 
Hence, another set of operations much like the original 
locate step described with respect to FIG. 9 is per- 
formed, with slightly tighter thresholds since it is no long- 
er necessary to worry about chopping off the comers of 
the PanaMark. 

[0098] At this point, the scanned PanaMark has been 
located, deskewed, and tightly cropped. The next step 
is to read out the data bits, which transforms the Pan- 
aMark from the image domain, where each bit is repre- 
sented as a collection of black or white pixels, to a 20 x 
20 bit array, in the preferred embodiment, of logical val- 
ues. Note that since the PanaMark symbology is clock- 
free, there are no predetermined reference patterns to 
help orient the reading process. However, the logical 
size of the PanaMark is known in advance, for example, 
a square measuring 20 bits on each side in the preferred 
embodiment. Moreover, because the bits in the mark are 
pseudo-randomized during the encoding process, any 
particular row or column of pixels will show a higher dis- 
tribution of black-white and white-black transitions near 
the edges in the logical rows and columns, and a lower 
distribution near the centers. This process is fully de- 
scribed in the '280 Application. Once the horizontal and 
vertical center lines have been established by the proc- 
ess described in the '280 Application, the bits are read 
out of the PanaMark by recording the pixel value lying 
at the intersection of each horizontal and vertical center 
line (for example, setting each "white" pixel value = "0" 
and each "black" pixel value = "1 "). 
[0099] As shown in Figure 18A, the basic clocking 
process corresponds to starting the N W (upper-left) cor- 
ner of PanaMark 184 and reading out the data in a left- 
to-right, top-to-bottom fashion. As shown in FIGS. 18C- 
18D, the same paradigm can be applied, though, begin- 



ning at any of the other three comers. (SW. NE. SE) as 
well. In theory, the output from these four alternate 
clockings should be the same. However, because of 
scanning variations and round-off errors, the results can 

s in fact be different. However, based on feedback from 
the ECC step, there is a chobe of using the best of the 
four different readings, made in the four alternate direc- 
tions described in FIGS. 18A-18D. In other words, if the 
ECC reports two errors detected during the left-to-right. 

TO top-to-bottom clocking, while the other clocking alt re- 
turn 1 5 errors, then the first set of results should be used 
and the remainder discarded. This has the effect of re- 
ducing the overall error rate with high probability. 
[0100] While the present invention has been particu- 

is larly shown and described with reference to the pre- 
ferred embodiments and varbus aspects thereof, it will 
be appreciated by those of ordinary skill in the art that 
various changes and modifications may be made with- 
out departing from the spirit and scope of the invention. 

20 It is intended that the appended claims be interpreted 
as including the embodiments described herein, the al- 
ternatives mentioned above, and all equivalents thereto. 



2S Claims 

1. A method of encoding information for printing on a 
printed medium, comprising the steps of: 

30 converting said information into a data stream 

having a sequence of "O" and "1" characters 
representing data bits; 

randomizing said data stream to produce a ran- 
domized data stream having a sequence of "O" 
35 and " 1 " characte rs; 

formatting the randomized data stream into a 
two-dimensional bitmap of rows and columns 
including only the randomized data stream bits; 
and 

40 printing the formatted bitmap onto said printed 

medium. 

2. The method as in Claim 1 , wherein said formatting 
step includes the addition of an anchor bit into at 

45 least one corner of said two-dimensional bitmap. 

3. The method as in Claim 1 or 2, wherein said format- 
ting step includes the addition of an anchor bit into 
each corner of sakl two-dimensional bitmap. 

50 

4. The method as in any one of Claims 1 -3, wherein 
said formatting step includes the step of mapping 
the randomized data stream such that each data bit 
within each block in said data stream is separated 

55 from every other data bit within the same block by 
at least one row and one colurnn in said two-dimen- 
sional bitmap. 



11 



21 

5. The method as in any one of Claims 1-4, wherein 
said randomizing step comprises the step of repeat- 
edly randomizing said data stream by alternate des- 
ignated keys until the randomized data stream con- 
tains a substantially uniform distribution of "O" and s 
■1" characters. 

6. The method as in any one of Claims 1-5, further 
comprising the step of compressing said informa- 
tion, 10 

7. The method as in any one of Claims 1-6. further 
comprising the step of adding error correction code 
data bits to said data stream. 

IS 

8. The method as in any one of Claims 1-7. further 
comprising the step of duplicating said data stream 
to produce redundant data for improving reliability 
upon encoding. 

20 

9. An information storage medium comprising a two- 
dimensional data barcode printed on a printed me- 
dium comprising a bitmap formatted representation 
of randomized encoded information having a prese- 
lected bit row and column length. 25 

10. A method of decoding randomized information 
printed on a printed medium in the form of a bitmap 
of rows and columns of data pixels representing en- 
coded data bits, each of said data pixels being ei- 30 
ther a first or second color, comprising the steps of: 



22 

in addition to said bitmap and said method includes 
the steps of formatting said bitmap and said other 
information to a pixel based grayscale representa- 
tion and determining the row and column bounda- 
ries by sequentially comparing the number of first 
pixels in each row and column respectively to a pre- 
determined threshold. 

12. The method according to Claim 10 or 11, wherein 
said skew angle determining step uses either a tem- 
plate matching procedure or a Hough-transform 
procedure. 

13. The method according to any one of Claims 10-12. 
wherein said deskewing step uses either a shear 
rotation procedure or a trigonometric procedure. 

14. The method according to any one of Claims 10-13, 
wherein said reading step reads said binary data in 
each of four alternate directions creating four alter- 
nate one-dimensional arrays of digital data, and 
said error correcting step selects the one-dimen- 
sional arrays of digital data with the least number of 
errors for output. 

15. The method according to any one of Claims 10-14, 
wherein said derandomizing step repeatedly deran- 
domizes the or each said one-dimensional array of 
digital data by alternate designated keys until the 
errors detected by said error correcting step fall be- 
low a preset threshold. 

16. The method according to any one of Claims 10-15, 
wherein said reading step comprises the steps of: 

determining the number of horizontal and ver- 
tical edges in each respective row and column 
of said bitmap; 

determining the row center lines from the local 
minima in the horizontal edge counts; 
determining the column center lines form the lo- 
cal minima in the vertical edge counts; 
and reading out binary data located at the in- 
tersection of each row center line and column 
center line in a predetermined pattern to pro- 
duce a one-dimensional digital representation 
of said digitized bitmap. 



scanning said printed medium to digitize said 
bitmap; 

fomiatting said bitmap to a pixel based gray- 3S 
scale representation; 

setting the threshold intensity level based on 
said grayscale representation such that pixels 
above said threshold correspond to said first 
color and pixels below said threshold corre- 40 
spond to said second color; 
determining the skew angle of said digitized bit- 
map; 

deskewing said digitized bitmap so that said 
skew angle is reduced to substantially zero; ^ 
cropping said digitized bitmap; 
reading out binary data from said digitized bit- 
map in a predetermined pattern to produce a 
one-dimensional array of digital data; 
derandomizing said one-dimensional array of so 
digital data; and 

error-correcting the derandomized one -dimen- 
sional array of digital data to produce a sub- 
stantially error-free digital representation of the 
encoded information. 55 
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11. A method according to Claim 1 0 wherein said print- 
ed medium has other information recorded thereon 
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FIG. 8B 
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